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Nuclear Magnetic Resonance (NMR) spectroscopy provides a valuable tool by 
which one can control a spin ensemble. Control is achieved by using radio-frequency 
(RF) pulses. Pulse sequence design has been an active research area for many years. 
Recently, optimal control theory has been successfully applied to the design of pulse 
sequences, so as to minimize their total duration and improve their efficiency 
In this paper, we develop a new class of genetic algorithm that computationally 
determines efficient pulse sequences to implement a quantum gate U in a three-qubit 
system. The method is shown to be quite general, and the same algorithm can be 
used to derive efficient sequences for a variety of target matrices. We demonstrate 
this by implementing the inversion-on-equality gate efficiently when the spin-spin 
coupling constants J\2 = J23 = J and J13 = 0. We also propose new pulse sequences 
to implement the parity gate and fanout gate, which are about 50% more efficient 
than the previous best efforts. Moreover, these sequences are shown to require 
significantly less RF power for their implementation. 

The proposed algorithm introduces several new features in the conventional genetic 
algorithm framework. We use matrices instead of linear chains, and the columns of 
these matrices have a well defined hierarchy. The algorithm is a genetic algorithm 
coupled to a fast local optimizer, and is hence a hybrid GA. It shows fast convergence, 
and running on a MATLAB platform takes about 20 minutes on a standard personal 
computer to derive efficient pulse sequences for any target 8X8 matrix U. 
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I. 



INTRODUCTION 



In recent years, there has been considerable interest in formulating time optimal pulse 
sequences in NMR. Various efforts have focused on replacing traditionally well known se- 
quences (for example, sequences to transfer coherence between coupled spins in multidimen- 
sional NMR experiments pQ) by their time optimal counterparts. 

The advantages of time optimal sequences are many. By reducing the time required to 
perform a desired unitary operation, they reduce the impact of undesirable effects due to 
decoherence or relaxation. The efficiency in achieving the desired operation can be improved 
drastically (in some cases it can be doubled [2]). It is becoming clear that any serious at- 
tempts at quantum computing [3] using NMR would require such time optimal sequences 
at their foundation. 

The process of formulating time optimal sequences, like every other process of optimization, 
involves minimizing a " cost" function. The most widely used cost function for a coupled spin 
system is the time for evolution of the system under spin-spin J coupling [1,2]. Under the 
spin diffusion limit approximation, the time required to implement hard pulses is negligible 
compared to this time. 

The NMR Hamiltonian can be decomposed as [1] 



where drift term is the part of the Hamiltonian internal to the system, consisting of the 



changed (this is achieved by using hard pulses). In this paper, we shall deal with the three- 
spin problem, where the spin-spin coupling constants Ju = J23 = J, and J13 = [2] . In this 
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spin-spin coupling term. 



™ =1 VjHj, is the part of the Hamiltonian that can be externally 
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The unitary evolution of 3 interacting spin | particles is described by an element of SU(8), 
the group of unitary matrices [I] with determinant 1. The Lie algebra su(8) is a 63 
dimensional space whose basis operators are 8x8 skew-Hermitian matrices [2,4]. Clearly 
then, the NMR time-optimal problem in this case becomes an optimization problem in a 63 
dimensional space. 

Recent efforts have used geometric control theory, where the time optimal sequence is 
derived from the optimal trajectory (geodesic) [2J [5] to be traversed from the initial state to 
the desired final state in this 63 dimensional space. For example, trilinear propagators of 
the form U = exp(— idliahpls-y), where a, f3, 7 G {x,y, z}, have been implemented using 
geodesic sequences having total period of about half of traditional methods. 
There have also been pulses optimized using dynamic programming [6]. This is a so called 
"greedy hill climbing" optimization technique. Essentially, any optimization problem can 
be viewed as consisting of a fitness landscape, where the goal of optimization is to reach 
to highest hill(global optimization). Dynamic programming is a point to point technique 
where one traverses from one point of the landscape to another point having strictly higher 
fitness. This has been used to optimize coherence transfer in the presence of relaxation (the 
ROPE sequence) 

However, finding time-optimal sequences for a general 3 spin NMR problem has still 
remained unsolved. Here it is desired to form a desired 8x8 unitary operator using pulse 
sequences that require minimum period of evolution under the drift Hamiltionian Hd 
(spin-spin coupling). We seek to solve this problem in two steps: first given a unitary 
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operator U, we find many (if not all) possible ways of realizing this operator using 
state-of-art optimal sequences [2]. Then we find which of these sequences is the best. By 
automating this process and devising an algorithm to decompose the operator U using 
optimal sub-sequences, it is hoped that we can obtain more efficient sequences for any 8x8 
operator U. 

The main results of this paper are as follows: 

1. We formulate a new genetic algorithm that provides various possible sequences for a 
matrix U using cascaded time optimal sequences. This genetic algorithm is different 
from conventional implementations, and has added features of being a hybrid with a 
local-optimizer, and having an in-built hierarchy that makes it faster. 



2. We then use this algorithm to optimize the parity gate and fanout gate. The new 
sequences are found to 50% more efficient than exisitng sequences, besides requiring 
much less RF power for their implementation [HI HP] . In case of the U= invert on 
equality gate, we determine a new effienct sequence when the spin-spin coupling 
Ji 3 = and Jxi = 3 23 - 



II. PRODUCT OPERATOR BASIS IN NMR 

We will only be interested in the 3-spin NMR problem that forms the space 577(8). 
Consider the Pauli matrices defined by [TT] : 




They obey the well known commutation relations 

[I x , I y ] = il z ; [I y , I z ] = il x ; [I z , I x ) = il y (3) 

ll=I 2 y=ll = \± (4) 
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where 1 is the identity element. The product operator basis is an orthogonal basis iB s , 
which for an n-spin problem [s-u(2 n )] takes the form 

n 

Bs = 2 q - 1 Y[(Ika) ak3 

k=l 

where a = {x, y, z} and = 1 © • • • <S> I a - 

Hence for a 3-spin problem, there are 64 base operators including the unity operator 1. 
These can be conveniently put in the tabular form as in Fig 1 

This table is a way of representing the 63 base operators that form a basis (along with 
the identity operator 1) in su(8). The Lie algebra su(8) can be decomposed as 

su(8) = d © [ 2 © I 3 © pi © p 2 © p 3 © q (5) 

where 

ti = = span i{Iix,Ii y ,Iiz} 
l z = = span i{I 2x ,l2y,l2z} 
l 3 = = span i{I 3x , I 3y , I 3z } 

represent the subspaces that are spanned by the hard-pulses on the first, second and third 
spins respectively. Bilinear spin terms form the subspace p 1 © p 2 © p 3 , where 

p! = span i{2I lx I 2x , 2I lx I 2y , 2I lx I 2z , 2I ly I 2x , 2I ly I 2y , 2I ly I 2z , 2I lz I 2x , 2I lz I 2y , 2I lz I 2z , } 
p 2 = span i{2I 2x I 3x , • • • } 
p 3 = span i{2I 3 Ji x , • • • } 

where the • • • in the last two equations represent 8 other operators ( a total of 9) formed by 

cyclic permutations of {x,y,z}. 

The subspace q contains the trilinear terms 

q = span i{4Ii x I 2x I 3x , 4Ii x I 2x I 3y , 4Ii x I 2x I 3z , • • • } (6) 

where the • • • represents a total of 27 operators whose span forms q, and which are formed 
by the cyclic permutation of the indices {x, y, z} in the trilinear propagator. 
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FIG. 1: The panel illustrates the 63 base operators of the product operator basis. Along with the 
unity operator 1, they form a basis to describe any 8x8 matrix for a 3-spin problem. The separated 
boxes show the different subspaces, and it is possible to travel inside each subspace by using only 
hard pulses, which in our optimization problem has no cost. 



It is this decomposition of su(8) that table 1 represents. The key issue involved with 
such a decomposition is the immediate utility it has with regards to our optimization prob- 
lem. The table shows that are a total of 7 subspaces that form su(8), but we can move 
within each subspace by using only hard pulses [HE]- For example, any trilinear propagator 
exp^—iOIia^ph-y) where a,/3, 7 G {x,y,z} can be implemented from exp(—i9Ii z l2 Z hz) by 
using only hard pulses. In our optimization problem , hard pulses have no "cost", and 
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FIG. 2: The panel illustrates the refocusing method of implementing bilinear propagators, and the 
geodesic sequence for implementing trilinear propagators 

hence all 27 trilinear propagators are equivalent in their fitness. This reasoning is also true 
for the bilinear propagators, ie any propagator exp(—i9I na Iip) where a,/3 G {x,y,z} and 
n,l G {1,2,3}, can be implemented from exp(—i9I nz Ii z ) by using only hard pulses. Two 
examples of the above ideas are [U [2] : 

exp(-i9I lx I 2y hz) = exp (-*7^ij/) ex P ( i ^ I ' 2x ) eyi ^{-^hzhzhz) exp (-^^z) exp (i^hy 
exp(-i9I ly I 2y ) = exp (i^hxj exp \^h^j exp(-i9I ly I 2y ) exp \~^hxj exp (—v^h^j (7) 

In summary any unitary 8x8 matrix U can be decomposed using hard-pulses, bilinear 
propagators of form exp(—i9I na Iip) and trilinear propagators of form exp(—i9Ii a I 2 i3l3 7 ). 
As far as the cost during the optimization process is concerned, all hard pulses have zero 
cost PQ El El EE2], an bilinear propagators are equivalent and all trilinear propagators are 
equivalent (for the same phase angle 9). 



III. THE NMR TOOLBOX 



In literature, trilinear propagators have been implemented optimally using geodesic pulse 
sequences. Bilinear propagators are implemented by standard refocusing techniques, where 
one of the spins is decoupled from the system. These sequences are shown in fig 2. 

Given a target unitary matrix U, we now break the problem of finding time-optimal 
pulses sequences to realize U into 2 steps 

1. Devise an algorithm to decompose U in several ways, using only hard pulses, bilinear 
and trilinear propagators. For a 3 spin problem consisting of a 63- dimensional space, 
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there may be several ways to realizing the matrix U. Consider a N-step decomposition 
of U as 

U = exp^-iejna) exp(-i9 2 lipl m ry) exp(-i9 3 I p5 I qt I rv ) ■ ■ ■ (Nsteps) (8) 

where the greek letters a, /3 ■ • ■ G {x, y, z}, and roman letters n,l ■ ■ ■ G {1,2, 3}, and 
any of 9i,i G [1, N] may be 0. 

At the outset this problem seems complicated, because not only are the angles 
0i variable, but so also are the choice of propagator and their ordering (because 
succeeding propagators may not commute). However, we develop a hybrid genetic 
algorithm, that is surprisingly fast in converging to a solution, (sec 5). We sweep 
through the number of steps N from about 3 to 10 to find many (if not all) possible 
ways of decomposing the matrix U. 

2. We then simply choose the best sequence amongst these sequences, and this is likely 
to be the best way of achieving the target U . 

IV. GENETIC ALGORITHMS 

Genetic Algorithms (GAs) are a class of global optimization algorithms [131 HH US] first 
introduced by John Holland in the 70s, and have be applied to various areas from mechanical 
engineering to radio-astronomy. They are biologically inspired algorithms, and seek to mimic 
the process of evolution and the strategy of "survival of the fittest". The key components of 
a traditional GA are a collection of bit-strings [H] that form a "population". The bit strings 
are usually coded representations of the various candidate solutions to the problem at hand, 
and the coding scheme is flexible and differs from problem to problem. Usually the starting 
population is a random selection from the candidate space [13] . Then, mimicking the process 
of mating and evolution, pairs of strings are chosen from the population and crossed over. 
Crossing over entails exchanging the bit-strings after a certain bit-position called a locus 
[H] (this is randomly chosen). However, similar to the "survival of the fittest" adage, the 
member of the population that is fittest is more likely to cross-over (a member can cross over 
more than once). There is also another operation called a mutation, which involves flipping 
over a bit at a random location [TU [15]. Mutation and cross-over are competing operations, 
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FIG. 3: Cross over operation in a traditional genetic algorithm. Here cross over is taking place at 
a locus =6 
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FIG. 4: Mutation operation in a traditional genetic algorithm. Here mutation is taking place at a 
locus =6 

but the probability of mutation is usually kept low (about 10% chance of mutation and 
90% chance of cross-over). If this process is continued over several reporduction cycles 
(generations), it is found that the population becomes fitter and fitter [13], until finally the 
optimum is reached. Fig 3 shows the tradional cross-over and mutation operations, and a 
flow-chart implementation of a simple GA. 

The main advantage of GAs are that they are a parallel search technique |13j . where all 
the members of the population search the candidate space simultaneously. Moreover, unlike 
point-to-point optimization methods like dynamic programming, where we always seek 
points of higher fitness, genetic algorithms allocate in a principled way a limited number of 
trials to solutions that are known to be inferior. It is this diversity, that potentially leads 
to the GA approaching the global optimum where other methods may fail. However, on 
the flip side, due to their high reliance on randomness and diversity, GAs are usually slow 
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to converge to a solution. 

V. NEW HYBRID HIERARCHICAL CLASS OF GENETIC ALGORITHM 

In our problem, we want to formulate a GA that provides several (if not all) possible 
decompositions of a target matrix U into hard pulses, bilinear and trilinear propagators. 
Because of the large number of variables, and problems due to non-commutativity, conver- 
gence of the GA in a reasonable time- frame would become difficult. Hence we break away 
from the traditional framework of the GA, and introduce new features that respect the ba- 
sis of genetic algorithms (parallel search based on fitter members crossing over more), but 
improve convergence: 

1 . We use matrices instead of bit strings 

2. We use digits 0-9 instead of restricting to only or 1. 

3. Our encoding scheme is as follows. The matrix consists of 4 columns and a variable 
number of rows. The number of rows specify the number of steps (N) we want to 
decompose the matrix U into. 

• The first column represents the subspace from which to pick a propagator (in 
table 1), note that this can go from 1-9. 

• The second column represents the propagator inside that subspace. This corre- 
sponds to the number of the square inside the subspace in fig 1. 

For example if a row has as first two elements 4 and 7, then it would correspond- 
ing to the propagator in the 7th square in the 4th subspace in fig 1, and this is 
exp(—i9l2 Z l3y)- For the subspaces 1,3,5 in fig 1 where there are only 3 propaga- 
tors, we make them triplicate, ie numbers 1-3, 4-6, and 7-9 in the second column 
would all be the same. This is to standardize the encoding scheme to have 9 
digits. 
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• The third and fourth columns represent the angle of rotation 9 of the selected 
propagator. The angle is computed as: 

9 = (3rd column number) x 45° + (4th column number) x 5° (9) 

4. Hence there is a certain hierarchy built into this encoding scheme. A change in 
number in the 4th column is unlikely to affect the solution as much as a change in 
the number in the 3rd column. 



5. This encoding scheme ensures that the candidate solution converges to within a 5° 
accuracy in each of the 9i,i G [1, A]. We then use a local optimizer, (which can be 
a modified genetic algorithm, or a dynamic programming algorithm) to approach to 
within an arbitrary accuracy (upto 0.1°). It is thus we call the algorithm a hybrid 
genetic algorithm, where we couple the inherent global optimization advantages of a 
GA with the fast convergence of a local-optimizer. 



A. Examples of Encoding Scheme 

To make things clear, consider two 4x4 matrices in Fig that are members of a population 
at some time. From Fig 1 and using our encoding scheme they correspond to the matrix 
decomposition: 

A = exp I -i— I lz 1 exp I -i— -h z I 2y h y 

( 285vr T \ ( 175tt T \ 
x exp y-i-^-hxhz I exp \-i-^-h x hxhy J (10) 

/ 435vr T \ / 280tt T T 
B = exp I -i-^hxhyhz J exp I -i^^-hxhxhy 

( .3007T \ / .1907T \ 

x ex P ( -i^^hzhy I exp l-t-^-hzhxhz I (11) 



B. Crossover and Mutation operations 

Traditionally, crossover involves exchanging 2 chosen bit strings at a randomly chosen 
locus (cross-over point), and repeating this process till a new population is created. In 
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FIG. 5: Two examples of the encoding scheme. The sequences they correspond to are given below. 

our scheme, crossover involves exchange of subblocks between 2 matrices (members of the 
population). Since we must respect the hierarchy between columns, we introduce 4 different 
kinds of crossover operations: 

1. Cross 1 : Involves cross over of sub-blocks of size n x 4 where n G [1, N], and N is the 
total number of rows of each member. 

2. Cross 2 : Involves cross over of sub-blocks of size n x 3 where n G [1, N], and the first 
column is not involved in cross-over. This is a means of respecting hierarchy, because 
the first column is more important than the second, which in turn is more important 
than the third etc. Additionally, at the end of this operation, with a small probability 
(10%), a mutation can be introduced in the first column. In thise random 
entry is chosen in the first column and is randomized in 1-9. 

3. Cross 3 : Involves cross over of sub-blocks of size n x 2 where n G [1, N], and the first 
two columns are not involved in cross-over. With a probability of 7%, a mutation can 
be introduced in the first two columns. 

4. Cross 4 '■ Involves cross over of sub-blocks of size n x 1 where n G [1, N], and the first 
three columns are not involved in cross-over. With a probability of 50%, a mutation 
can be introduced in the first three columns. 

5. Flip : This is an operation that seeks to address problems due to non-commutativity 
of successive propagators, and hence to improve convergence to a valid decomposition. 
This is a unary operation, and involves only one member of the population (unlike 
cross-over that occurs between two members). Here, two rows are randomly selected 
and flipped over (exchanged). This can be done more than once. This process is 
repeated to half the total members of the population . 
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FIG. 6: Crossover operations in our genetic algorithm, applied to the two example population 
members A and B above. There are 4 kinds of cross-over operations and 1 flip operation to 
account for non-commutativity amongst succeeding propagators. The CROSS operations involve 
two members, while the FLIP operation requires only one member. 

6. Mutate : Genetic algorithms traditionally have a tendency to get trapped at local 
optima. This is because a certain member of the population may be fitter than the 
remaining (but not the fittest possible), and may reproduce widely, leading to all 
members having almost the same characteristics, and the GA getting "trapped". To 
break out of this scenario, we introduce the mutation operation after the 30th gener- 
ation. The probability of mutation increases linearly from in the 30th generation to 
about 0.35 in the 50th generation (we run upto 50 generations). In this operation, 10 
members of the population (usually population size is taken between 500 and 1000) 
are chosen, and all their entires are randomized between 0-9. 
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C. Fitness criteria 



Most optimization algorithms including GAs work best when the entire fitness of a mem- 
ber can be encapsulated into one parameter or expression. Multi-parameter optimization 
techniques are usually difficult to implement and slow to converge. However, the choice of 
the fitness parameter must be such that a gradual increase in fitness will lead to a closer 
approach to an optimum solution [13] . Moreover, the fitness landscape (the pictorial rep- 
resentation of various candidate solutions as hills and vallies) must have one clear global 
optimum and not too many local optima. GAs tend to get trapped in local optima [I3J US] , 
and a fitness function with many vallies of the same depth may lead to the GA getting 
trapped in one of them without finding the deepest valley (global optimum). 
We used three kinds of fitness functions depending on the target matrix U. Let G be the 
candidate solution (one member of the population), and U be the target 

1 
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where the sum in F 2 and F3 indicates the sum over all 64 elements of the matrix \G*U — WU\ 
and \G — U\. Function F\ works best when the target U is sparse, and this is usually true 
for all quantum gates of interest. However, when U is diagonal and many elements are 1, it 
tends to fall into a local optimum trap of G = lsxs ie the unity operator. For example in 
implementing the A 2 (I Z ) gate [2,18,19] 
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the solution employing F\ tends to get trapped into lsxs- 

In such cases F 2 is found to work better. F 3 works best when the matrix U is not sparse. 
This has limited application in most useful quantum gates, but it can be used in running a 
local-optimizer. 

D. Details of convergence and algorithm parameters 

We worked with population sizes ranging from 500 to 1000. Larger population sizes lead 
to better convergence to the solution but take longer time. A single run of 50 generations 
takes about 5-7 minutes on a 2.6GHz AMD personal computer using MATLAB. The rel- 
atively short times involved, and considering that we use only a standard PC make this 
method attractive. The run process is as follows: given the matrix U, we fix the number of 
rows of the population members as 3 to start with. The entries of the population are initially 
taken to be random numbers from 0-9. Then we run over 50 generations, and if U can be 
decomposed using only 3 steps, the solution usually appears as the member having highest 
fitness at the end of the run. To explore if there are any other decompositions of U using 
only 4 steps, the process can be repeated. However, there is no guarantee the solution may 
appear, as the algorithm may tend to converge to one of the solutions more predominantly. 
The number of rows is increased to 4, and the whole process is repeated, again to 50 gener- 
ations. By repeating this and sweeping the number of rows (this corresponds to the number 
of steps N we seek to decompose U into), from 3-10, we obtain various possible decomposi- 
tions of U. 

However, it is not ensured that all decompositions of U appear as a result of this algorithm. 
But most target matrices of interest cannot be decomposed in too many different ways, and 
the algorithm still yields useful and more efficient pulse sequences than those currently being 
employed. 

E. Local Optimizer 

If an exact solution is found, then the fitness according to (15) should be infinite. How- 
ever, since we are doing calculations in MATLAB using fixed point arthimetic, it is usually 
about 10 15 . Since the accuracy of the 6i in the above GA is only to 5°, the best solutions 
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usually have fitness of about 1000. To approach the exact solution from here, we use a local 
optimizer. This can be done in two ways: 

1. Use a "greedy hill climbing" algorithm that sweeps each 0i in the range ±10°, and 
finds the best solution. 

2. Use another genetic algorithm where to all the Qi are added random numbers in the 
range ±10°. By using a large population size (2000), and using fitness function F 3 , 
and only employing cross-over operation CROSS-4 but allowing the fourth column 
entries to be floating point numbers, we find a solution to within about 0.1° accuracy 
in 5 minutes. 



In this section, we present some results of our algorithm in finding efficient sequences 
for the inversion on equality gate, parity gate and fanout gate, and compare them with 
traditional sequences. 



Inversion-on-equality (J = ) gate inverts the state when the states of all the qubits are 
equal. In a 3-qubit system it act as [H] 



VI. RESULTS 



A. Inversion on Equality Gate 




(16) 
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FIG. 7: The traditional implementation of the inversion on equality gate when Ji2,^23 and J13 
exist. Here t\ = 1/2 J12, T2 = I/2J23 and T3 = I/2J13. 

The unitary operator for implementation of this gate is of the form 



/ 


-1 




















o\ 







1 




























1 




























1 































1 






















1 































1 





V 























-v 



(17) 



Applying 7= on a three qubit system, when one of the qubits is in state |1 >, results in 
controlled-Z gate in the other two qubits. Such gates are used in the implementation of 
universal CNOT gates. The well known sequence for this is jH [10] 

U I= = exp {-i2nJ 12 TiIi z I 2z ) exp {-i2n htthzhz) exp (-i2nJ 3 iT 3 I lz I 3z ) (18) 
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where t± = 1/2 Ji 2 , t 2 = 1/2 J23 and r 3 = 1/2 J13. Our program yields this same sequence 
as the best possible. However, this is in the case when all three coupling constants J12, J23 
and Ji 3 exist. What will be the optimal sequence in oft-considered case when J 13 = and 
J12 — J23 — 



The advantage of automating the process is that it just requires a small change in the code. 
We replace all the bilinear propagators exp(—i9 hahp) with the the combined propagator 
of form exp {—iO(Ii z I 2z + I^z^z))- This encapsulates the fact that J13 = 0, as the bilinear 
propagators involving Ii and ^3 will not be implementable. Our program leads in this case 
the sequence 

U I= = exp (-^l) ex P {-iZnhzhxhy) exp {-m{I lz I 2z + hzhz)) 

x exp (-i2irI lz I 2y I 3x ) exp (-iirl 2y l 3y ) (19) 

This requires a total period of r = 2.73/ J. We believe this is the best way of realizing Uj = 
in this case. 



B. Parity Gate 



Parity gate adds (addition modulo 2) the control bits to the target bit [THl HZ]. The 
parity gate on a 3-qubit system where third qubit is the target qubit and the first two qubits 
are control, has the form 

/l o\ 
1 
1 
1 
1 
1 
1 
\0 0000001/ 

The conventional implementation uses a two-qubit U 2 gate which is of the form [91 [10] 
U 2 = e ~ i27r ( J/lz/2z V where r = 1/2 J. W is of the same form but with r = 3/2 J. Besides U 2 



(20) 
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FIG. 8: The traditional implementation of the parity gate requiring a total time period of r = 2.5/ J. 
Here n = l/2Ji 2 , r 2 = 1/2J 23 and r 3 = 1/2J 13 . 



and pseudo-Hadamard gates, the other gates used are phase gate and CNOT gate given by 
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% 



(21) 



CNOT 



(22) 



/l \ 

10 

1 

\0 1 J 

The sequence and circuit are shown in fig 4. The total time required is r = 2.5/ J. 
Using our improved genetic algorithm, we obtain the following decomposition for the parity 
gate 
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FIG. 9: The traditional implementation of the fanout gate requiring a total time period of r 
2.5/ J. Here n = 1/2 Ji 2 , r 2 = 1/2J 23 and r 3 = 1/2J 13 . 



P = exp ex P (^ _i Y /3x ) 6XP ^ nIlzl2z ^ exp {-^hzhzhx) 



(23) 



requiring a total period of 1.366/ J, which is 54% faster than the conventional sequence 
[§]. Moreover, this sequence is found to require only about 25% of the RF power of the 
conventional sequence. 



C. Fanout Gate 



Fanout gate adds (addition modulo 2) a control bit onto n target bits [TBI CEO HE] -The 
fanout gate on a 3-qubit system where first qubit is the control qubit and other two qubits 
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are target is shown below. The (classical) value of control bit is copied or fanned out to 
the target bits if the target bits are initially in |0 > state. However, if the control bit is 
in coherent superposition, the fanout gate creates entangled states. A (n+1) fanout gate is 
conventionally built out of (n+1) parity gate by applying Hadamard gates on both sides of 
the parity gate on the first n-qubits [9j ITU] 

/l o o o\ 
1 
1 
1 
1 
1 
1 
\0 0001000/ 

The conventional sequence requires a time period of r = 2. 5/ J. Our algorithm yields 

F = exp ex P {i^hxhx) exp {-ilirhzhxhx) exp 

requiring r = 1.366/ J, which is 54% faster than the conventional period. 



(24) 
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VII. CONCLUSION 



In this paper, we have developed a new algorithmic way of decomposing a target matrix 
U into hard pulses, bilinear and trilinear propagators. By automating this process and 
finding various possible decompositions of a matrix, we show that one can determine more 
time efficient pulse sequences to realize the same gate U for a 3-spin NMR problem. This 
is then applied to determining efficient pulse sequences for the invert-on-equality gate Ui = , 
the parity gate P and fanout gate F. In the last two cases the sequences are about 50% 
faster than conventional sequences, and require lesser RF power for their implementation. 
Although this is a marginal improvement, it is interesting that the same algorithm is able 
to yield both sequences by only changing the target matrix U. We foresee the use of such 
an algorithmic technique when either the pulse sequence for a gate is unknown (as in Uj = , 
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or if it is too difficult to intuitively determine. 
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VIII. APPENDIXES 

We present here the flow-charts describing the traditional genetic algorithms, and our 
improved genetic algorithm. 
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FIG. 10: The traditional genetic algorithm employing bit-strings and single crossover and mutation 
operations. For definiteness, we fix the population size as 100, bit-string length as 8 bits and the 
number of reproduction cycles (generations) as 30 in this flowchart. 
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FIG. 11: The improved genetic algorithm employing matrices and a mutation and 5 types of 
crossover operations. We typically used population sizes as 500, and number of generations as 50. 
This flowchart illustrates the case when N = 4. The program is swept from N = 4 to N = 10. 




FIG. 12: The generic crossover operation CROSS n where n G [1,4]. 




FIG. 13: The FLIP operation. This is introduced to account for non-commutativity of suceeding 
rows (propagators) in the member matrices. 



